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Abstract 

In this paper, we analyze the time complexity of finding regular poly- 
gons in a set of n points. We combine two different approaches to find 
regular polygons, depending on their number of edges. Our result depends 
on the parameter a, which has been used to bound the maximum number 
of isosceles triangles that can be formed by n points. This bound has been 
expressed as 0{n 2+2a+e ), and the current best value for alpha is 0.068. 

Our algorithm finds polygons with 0(n a ) edges by sweeping a line 
through the set of points, while larger polygons are found by random sam- 
pling. We can find all regular polygons with high probability in 0(n 2+a+e ) 
expected time for every positive e. This compares well to the 0(n 2+2a+E ) 
deterministic algorithm of Brafi [T]. 



1 Introduction 

The focus of this study is on the detection of regular structure in point sets. 
Our motivation comes from observations that have been published concerning 
extraordinary symmetries in the placements of ancient towns, temples and other 
important locations. For instance, the oracle of Delphi has been measured to be 
the apex of isosceles triangles with at least seven pairs of ancient Greek cities^ 
The same is true for the oracle at Dodoni, while the small island of Delos is the 
apex of at least thirteen isosceles triangles. All three of the central locations 
were considered to be among the most important of places, and in fact Delphi 
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Figure 1: Can you find all regular fc-gons in this figure? Solution is in Figure [2] 

was considered to be the navel of the world. In general, one may find seemingly 
countless cases of collinearity, reflective symmetry, partial n-gons and networks 
of isosceles triangles when looking at the graph of cities in the ancient world, 
from the British Isles to the Middle East. 

We will not concern ourselves further questioning whether such structures 
were carefully constructed or instead an expected result on large complete geo- 
metric graphs. However, the topic generates other interesting questions. If one 
chooses a particular location as a temple, it is not difficult to construct cities 
(at least on paper) so that the temple becomes the center of several symme- 
tries. What about the opposite? Given a set of existing cities, where should 
one decide to place a temple? Or, to ask differently, where should one look for 
a hidden temple? 

2 Related Work and Statement of New Result 

Given a set of n points, we wish to find the maximum subset which satisfies a 
specific symmetry or structure. 

The algorithm by Brafi [Q, for finding maximum symmetric subsets in point 
sets, is capable of handling reflective lines, translations, rotational symmetries 
and repeated sets. The time complexity is 0(n 2136+e ) for every positive e. 

The bound of Brafi depends on the maximum number of isosceles triangles 
formed by a set of points in the plane. Pach and Agarwal [4] bound this value 

by 0(n 2+1 / 3 ). This was improved by Pach and Tardos [5] to 0(n 5e - 1 +c ) ~ 

0(n 2.136+ £) _ 

Brafi noted that another result of his algorithm was to find all regular poly- 
gons contained in the set. We improve the time complexity for detecting regular 
polygons in point sets, although unlike the algorithm by Brafi, our algorithm is 
randomized. 

Our bound is 0(n 2+a+e ), where a~0.068. Notice that a, the fractional 



Figure 2: All regular fc-gons in the point set illustrated in Figure [T] 



component in the exponent of n, is half of the equivalent component in [T]. 
This is no coincidence. Our algorithm is designed to reduce this fraction by a 
factor of 2. Thus, any improvement of the result of Pach and Tardos in [5] will 
be directly reflected in our algorithm. 

3 Model of Computation 

We assume that all coordinates and other values are stored in a format that 
allows constant time equality testing and hashing. As hashing is only used to 
speed up one dimensional searches, it can be substituted with a comparison- 
based structure. This increases the computational complexity by a logarithmic 
factor, which is absorbed into the e. Furthermore, as exact computation meth- 
ods are typically not used, comparison based structures can be used to substitute 
equality tests with proximity tests for suitably small proximity values. This will 
compensate for any small discrepancies in the computation. 

Note that we frequently use the variable e inside asymptotic notation. Such 
claims hold for any constant e > 0, and the asymptotic notation may hide 
constants that depend on e. Thus, for example, /(n) = 0(rf log n) if and only 
if f(n) = 0(n £ ). 

4 Results 

We state our main theorem immediately. 

Theorem 1. There is an algorithm which, with high probability, finds all poly- 
gons in a set of n points, in expected time 0(n 2+a+e ). 

We obtain our result by considering small and large fc-gons separately, ob- 
taining the same bound in both cases. Thus we have two separate proofs given 
in this section. For k < n a , we give a sweep- line algorithm in section |4.1| 
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(Lemma Larger polygons are found by the method given in section 4.2 
(Theorem |2|. Let regular polygons of size greater than n a be denoted as >n a - 
gons. Smaller polygons are denoted as <n Q -gons. 



4.1 Small fc-gons 

Lemma 1. For any a, we can find all <n a -gons formed by n points, in 
0{n 2+a \ogn) time. 

Proof. Let S be the given set of n points. We compute all line segments defined 
by pairs of points in S, and we view this as an embedded graph. For each vertex, 
we construct a hash table containing incident edges, stored by key value and 
length. 

Let (pi = 7r — and let $ = {03, 04, . . . 4> n <*} be the set of all angles formed 
by three adjacent vertices in a regular <n Q -gon. Let e=ui«2 be an edge in the 
embedded graph. We will show how to determine if any neighboring edge of e 
might be in the same <n Q -gon as e, in 0(n a ) time. To do this, we use $ and 
the hash tables of vi and t>2- 

The algorithm uses a simple left-to- right line sweep (see Figure |3j. As 
we sweep, we propagate two types of messages along edges: "possible fc-gon 
above/below". Edges may carry several messages at the same time. During the 
sweep, we stop at each vertex v and process the following types of events: 

• Origination Event. This occurs when v might be the leftmost vertex of 
a fc-gon. If two edges of the same length are to the right of v and are at 
angle <fik, then we give a "possible fc-gon below" signal to the upper edge 
and a "possible fc-gon above" signal to the lower edge. 

• Propagation Event. This event confirms or rejects a scenario in which v is 
a possible vertex of a fc-gon (not the leftmost or rightmost). The event is 
triggered when a "possible fc-gon above/below" signal is received from an 
edge e incident and to the left of v. If there is a right-facing edge r, of the 
same length and forming an angle <fik with e, the signal is propagated to 
r. Naturally, the orientation of the angle must match the type of signal 
(above or below). If there is no such right-facing edge, the signal (and 
candidate fc-gon) is discarded. 

• Termination Event. This event detects the rightmost vertex of a fc-gon. 
This happens if v receives a "possible fc-gon above" and a "possible fc-gon" 
below signal from two left-facing edges of the same length, angle cpk, and 
in the proper relative orientation. We output the description of the fc- 
gon (the center, rotation, and value fc can be easily determined from the 
information at hand), and we do not propagate the two incoming signals. 

The sweep exhaustively scans every possible <rt a -gon. If such a polygon 
exists, the origination event will identify its leftmost vertex. Propagation will 
verify that the correct angles and edge lengths exist throughout the polygon, 
and termination will match the top and bottom chains of the polygon. 



4 




Figure 3: Handling small fc-gons. Left: propagation; Right: termination. 



It takes 0(n 2 logn) time to perform a line sweep. (The astute reader will 
realize that a topological sweep [2] could be used instead, at a cost of 0(n 2 ), 
but as all logarithmic factors get absorbed in the e, this is not critical). There 
are at most 2n a possible signal types, two for each polygon size. If all signal 
types were to appear on each of the 0(n 2 ) edges, the total number of signals 
over the entire sweep would be 0(n 2+a ). Propagation and termination events 
are done with table lookup and take constant time for each event. Generating 
all origination events takes 0(n a ) table lookups for each incident edge to the 
right, as all angles in $ are searched. Thus, the total runtime is 0(n 2+a ). □ 



4.2 Large fc-gons 

In this section we show how to identify all >n Q -gons in S with high probability. 

Each such polygon will be quickly identified by its own special triangle. In 
fact we show that each polygon is associated with several triangles that could 
uniquely identify it. We sample triples of points in such a way that, with high 
probability, at least one special triangle will be chosen for each existing fc-gon. 
Our algorithm can be summarized by the following: 

• Randomly select "enough" isosceles triangles from the point set. 

• For each isosceles triangle, determine if it is such that it could uniquely 
identify a fc-gon (i.e. if it is special). 

• For every special triangle, test if the vertices of its unique matching fc-gon 
exist. 

In the remainder of this section, we describe how to select random isosceles 



triangles (see 4.2.1 1, we describe special triangles and their properties (see 4.2.2 1, 



and we explain how to test for all fc-gons given a sufficient set of special triangles 



(4.2.31. Finally, we piece these elements together and describe the entire process 



that gives our result (4.2.41 
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4.2.1 Random selection of isosceles triangles 



We start by explaining how to select a random apex of an isosceles triangle from 
the set of all apices of isosceles triangles formed by a point set. Note that we 
wish to do this without explicitly constructing all isosceles triangles. 

Lemma 2. Let S be a set of n points. Three ordered points p,q,r £ S form an 
isosceles triple if \pq\ — \pf\. With 0(n 2 ) preprocessing, an isosceles triple can 
be selected uniformly at random from the set of all isosceles triples, in constant 
time. 

Proof. Let G be the complete geometric graph on S. Let e(p, £) be the set 
of segments of length £, incident to p: e(p,£) = {pq£G | |p§|=/}. Let B — 
{e(p,£) I |e(p,£)|>2}. Let / be the set of all isosceles triples. Note that 

I I — 2-,e{p,l)eB Z \ 2 )■ 

Using bucketing, we can compute e(p, £) (and \e(p, £)\) for all p and £ in 
0(n 2 ) time. This can be done as follows. For each {p, q}^S, pj^q, add the edge 
pq to the bucket with label e(p, \pq\), creating the bucket if it does not exist. 
The set E, of at most (™) buckets, is obtained by deleting all buckets with only 

pi) 2 

one edge. Then, for each bucket b £ E assign a weight of w(b) = — v ^ |c| . a . 
Note that J^beB w (b) = 1- 

Now, by selecting a random bucket b(p, £) £ B, as well as two different 
random edges pq,pf £ b(p,£), we obtain a random isosceles triple (p,q,r). This 
can be done with three random choices in time O(l). The two edges pq and pf 
both have length £ by virtue of being in e(p,£). Thus (p,q,r) is guaranteed to 
be an isosceles triple. 

Given an isosceles triple (p, q,r), pq and pf will appear in exactly one bucket 
b(p,pq — pf) and the probability of picking this triple will be 



Pick a bucket 



ScGB (2) 




Since |/| is the number of isosceles triples this method chooses each triple uni- 
formly at random. Our claim follows, since every isosceles apex corresponds to 
exactly two ordered triples. □ 



4.2.2 Special triangles: characterization and properties 

Any special triangle T is particular type of isosceles triangle with vertices be- 
longing to S. What makes T special is that its vertices belong to a (potentially 
incomplete) regular /c-gon of S, and furthermore the number of fc-gon vertex po- 
sitions skipped by the non-base sides of T is relatively prime to k (see Figure 13}. 
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Figure 4: Illustration of special triangles. The isosceles triangle with apex angle 
a is special, since it fits in the pentagon on the left, and satisfies the requirement 
of being relatively prime (the ratio is 2/5). The same triangle does not satisfy 
this requirement in the 15-gon in the middle (ratio: 6/15). On the right we 
illustrate the special triangles for the 15-gon (1,2,4,7 / 15). 

In this section we give bounds on the probability that a triangle is special. We 
also explain how a special triangle is uniquely associated to one regular fc-gon. 
Let p(k-Ly) denote the probability that fc is relatively prime to y. 

Lemma 3. Let k be an integer satisfying 3<k<n, and let y be an integer chosen 
uniformly at random in the interval [1, |~| — 1]]. Then p(k±.y) — 0( lo ^ w ). 

Proof. We know from the prime number theorem (Gauss, 1792; see [3]) that the 
number of primes smaller than any given integer x is @(jj|-j)- Thus there are 

®(logfe) P rmies m the range [y/k, — 1] ) - Any such prime y satisfies k±y. The 
probability that some y is chosen uniformly at random is ©(j^g) = ^(n^)- O 

Lemma [3] tells us that in a regular fc-gon, among all isosceles triangles with 
a given apex, the probability of randomly choosing a special triangle is ^(j^^)- 
The following Lemma confirms that we can quickly associate a given special 
triangle to its unique fc-gon. 

Lemma 4. If a regular k-gon has a special triangle T, then T is not special for 
any other regular k'-gon. 

Proof. Suppose that each non-apex side of T skips d vertices of the fc-gon. Let 
9 be the apex angle of T. This angle can be viewed as a function of r, 

If T can be embedded on some other fc'-gon, where d' vertices are skipped, 
the ratio p must also equal 9. Since T is special for the fc-gon, we know that 

k±d. Therefore for any other regular fc'-gon, we have 37 = fg, where c > 1. 
This means that T does not satisfy the property of being relatively prime for 
other regular polygons. □ 

Lemma 5. Let T be chosen uniformly at random from the set of all isosceles 
triangles formed by points of a regular k-gon, where k is not given as input. 
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After 0{n 2 ) preprocessing, we can determine the value k in 0(1) time with 



Proof. In 0(n 2 ) time, we can construct a table containing the angle of every 
special triangle for every regular fc-gon, for k<n. This is done as follows. For an 
isosceles triangle present in a fc-gon, let the isosceles ratio be the side length of 
the triangle divided by k. For every fc-gon, we can construct a list of all isosce- 
les ratios, in O(fc) time. The total time is quadratic over all values of k < n. 
Then, we eliminate items with the same ratio, except for the one created by an 
irreducible fraction. 

By Lemma [4] the elements of the list are unique. Thus, if we are given a 
special triangle, we can use binary search (or hashing for O(l)-time) to locate 
the fc-gon for which the triangle is special. 

Now suppose that each non-apex side of T skips d vertices of the fc-gon. 
Clearly d is in the range [1, ["§ - 1]]. By Lemma|i] p(k±d) = ^(j^)- □ 

Lemma 6. For any >n a -gon G, at least ^ of the isosceles triangles formed by 
vertices of S are in G. 

Proof. According to [5j, the number of isosceles triangles among n points in the 
plane is 0(n 2+2a+e ). Any >n"-gon G defines Q(n 2a ) isosceles triangles. Thus, 
at least 1/n 2 of the isosceles triangles from S have all three points in G. □ 

Lemmas [3] and [6] directly imply the following. 

Corollary 1. For any >n a -gon G, n2 \ og - of the isosceles triangles are special 
triangles of S associated with G. 

4.2.3 Using special triangles to test the existence of regular polygons 

We have already established that a special triangle is associated to a unique fc- 
gon, but this does not mean that the fc-gon actually exists in S. In this section 
we show that the total cost of such verifications is expected to be on the order 
of the total complexity of fc-gons in a set. In other words, false verifications do 
not cost too much. 

Lemma 7. The sum of complexities of all >n a -gons is at most n 2+a+e . 

Proof. Let m be the number of i-gons in a fixed set S of n points. The sum of 
the complexities of all >n Q -gons is J2i=\n a ] A n v fc-gon generates at most 
0(fc 2 ) isosceles triangles of which at most 0(^^) are special. Thus there are 

at most X)™=[n Q ] fe^f special triangles. We know from [5] that there are at most 
0(n 2+2a+e ) distinct isosceles triangles. Since special triangles are distinct, 



probability ^(i^). 



n 



;2 




8 



which since n a <i<n gives 

a n 

and dividing and absorbing the log into the e gives 

n 

tin = 0{n 2+a+e ) 

This last equation is exactly the statement of the lemma. □ 

Definition 1. If a k-gon has at least k/2 vertices in S, then it is more-than- 
half-full. Otherwise it is less-than-half-full. 

Corollary 2. The sum of complexities of all more-than-half-full >n a -gons in 
a set S of n points is at most n 2+a+t . 

Proof. This is an easy variant of Lemma [7] as the more-than-half-full condition 
only affects constants in the asymptotic notation. □ 

Lemma 8. Let T be a special triangle, associated to a k-gon P. We can decide 
if P is less-than-half-full in S in O(l) expected time. Otherwise if it is more- 
than-half-full, we decide if it is completely full in 0{k)-time. 

Proof. The center of P is found from the circumcenter of T in constant time. 

Given T, the center of the polygon, and the value of k determined by 
Lemma [5] we can compute the location of any vertex of P in constant time. 
We begin checking the k— 3 unconfirmed vertices in random order, without re- 
placement. If any vertex is not present we terminate the procedure. Otherwise 
we output the candidate fc-gon. 

Checking an entire /c-gon takes O(k) time, so if P is more-than-half-full, our 
claim trivially holds. If P is less-than-half-full, over half of the k— 3 tests will 
fail. Since the tests are ordered randomly, the expected number of tests is at 
most 2. Thus we expect to spend only O(l) time in this case. □ 

4.2.4 The algorithm 

So far, we have shown that we can identify every large regular polygon in a 
point set, if we manage to find a special triangle belonging to each such polygon. 
We have hinted that these special triangles will be found by selecting "enough" 
random isosceles triangles. The following theorem addresses this issue and states 
our main result for large regular polygons. 

Theorem 2. With high probability, we can find all >n a -gons in a set S of n 
points in the plane in expected time 0(n 2+a+e ). 
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Proof. The Coupon Collector problem tells us that if we randomly select items 
out of a set of n elements, with replacement, we should expect to have observed 
all n items after 0(n\ogn) selections. This also means that if we have n disjoint 
sets of elements, and the probability of selecting something from each set is equal 
(i.e. 1/n), then we expect to have obtained a sample from each set in 0(n log n) 
time. More generally, if the minimum probability of sampling from one of the 
disjoint sets is 1/p, then we expect to have sampled from all sets in 0(plogp) 
time. 

By Corollaryjlj - of all isosceles triangles in S are special, for each >n a - 

gon. So we set this as p — n 2 log n to conclude that we expect to have sampled a 
special triangle for each such polygon in 0(n 2 log n-log(n 2 log n)) — 0(n 2 log 2 n) 
time. 

We select 0(n 2 log 2 n) random isosceles triangles formed from the vertices of 
S. So, with constant probability we will obtain at least one special triangle for 
every >n Q -gon. With quadratic pre-processing time, we can select each random 
isosceles triangle in constant time (Lemma [2]). 

By Lemma [5] in constant time we can determine if one of our random isosce- 
les triangles is special, and if so we can determine the position of its unique 
candidate /c-gon. Lemma [8] explains how we perform a verification for each spe- 
cial triangle (or, in other words, for each potential fc-gon). We expect to spend 
O(l) time for each less-than-half-full fc-gon, which means 0(n 2 log 2 n) time for 
all such cases. The cost of verification for more-than-half-full /c-gons is propor- 
tional to their size. We can avoid verifying the same fc-gon twice by constructing 
a hash table, using the center and top vertex coordinate as keys. Corollary [2] 
states that the total size of all more-than-half-full >n Q -gons is 0(n 2+a+e ). □ 

5 Combinatorial questions 

A purely combinatorial question is the following: what is the maximum com- 
plexity of all of the regular polygons in a set of n points in the plane. Lemma [7] 
bounds the complexity with respect to regular >n"-gons to be 0(n 2+a+e ). Triv- 
ially, the complexity of all regular <n 7 -gons is C>(n 2+7 ), for any 7. Thus, the 
total complexity of all regular polygons is 0(n 2+a+e ). This bound is highly 
dependent on the number of isosceles triangles, but perhaps a tighter bound is 
possible by using more properties of regular polygons, instead of simply treating 
them as generators of isosceles triangles. 

We note that an improvement of the bound of the total complexity of regular 
polygons from 0(n 2+a+e ) to 0(n 2+ S a+c ) would give a corresponding speedup in 
our algorithm (by increasing the number of random triangles and reducing the 
cutoff between small and large polygons). This is because the current bottleneck 
in our algorithm is the last step, the successful verification for special triangles. 

Brafi observed that one can construct a set of points with c^n 2 regular k- 
gons, for any constant k, which gives a lower bound Q,{n 2 ) for the number of 
regular polygons in a point set. An upper bound on the number of regular 
polygons is 0(n 2 logn). Every pair of points defines at most two fc-gons, thus 
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9 2 

there are at most /c-gons. The total number of regular polygons is therefore 

at most J2k=3 = 0(n 2 logn). 

While the gap between the lower and upper bounds is much larger for the 
complexity of polygons rather than the number of regular polygons, it remains 
unknown if these two quantities differ asymptotically. 
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